92 research outputs found

    High-throughput sequencing reveals a simple model of nucleosome energetics

    Full text link
    We use nucleosome maps obtained by high-throughput sequencing to study sequence specificity of intrinsic histone-DNA interactions. In contrast with previous approaches, we employ an analogy between a classical one-dimensional fluid of finite-size particles in an arbitrary external potential and arrays of DNA-bound histone octamers. We derive an analytical solution to infer free energies of nucleosome formation directly from nucleosome occupancies measured in high-throughput experiments. The sequence-specific part of free energies is then captured by fitting them to a sum of energies assigned to individual nucleotide motifs. We have developed hierarchical models of increasing complexity and spatial resolution, establishing that nucleosome occupancies can be explained by systematic differences in mono- and dinucleotide content between nucleosomal and linker DNA sequences, with periodic dinucleotide distributions and longer sequence motifs playing a secondary role. Furthermore, similar sequence signatures are exhibited by control experiments in which genomic DNA is either sonicated or digested with micrococcal nuclease in the absence of nucleosomes, making it possible that current predictions based on high-throughput nucleosome positioning maps are biased by experimental artifacts.Comment: 36 pages, 13 figure

    Improved annotation of 3' untranslated regions and complex loci by combination of strand-specific direct RNA sequencing, RNA-seq and ESTs

    Get PDF
    The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct annotation is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3-prime untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3-prime polyadenylation sites to within +/- 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3-prime UTR re-annotation (including extension of one 3-prime UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental dataComment: 44 pages, 9 figure

    Nucleolar Association and Transcriptional Inhibition through 5S rDNA in Mammals

    Get PDF
    Changes in the spatial positioning of genes within the mammalian nucleus have been associated with transcriptional differences and thus have been hypothesized as a mode of regulation. In particular, the localization of genes to the nuclear and nucleolar peripheries is associated with transcriptional repression. However, the mechanistic basis, including the pertinent cis- elements, for such associations remains largely unknown. Here, we provide evidence that demonstrates a 119 bp 5S rDNA can influence nucleolar association in mammals. We found that integration of transgenes with 5S rDNA significantly increases the association of the host region with the nucleolus, and their degree of association correlates strongly with repression of a linked reporter gene. We further show that this mechanism may be functional in endogenous contexts: pseudogenes derived from 5S rDNA show biased conservation of their internal transcription factor binding sites and, in some cases, are frequently associated with the nucleolus. These results demonstrate that 5S rDNA sequence can significantly contribute to the positioning of a locus and suggest a novel, endogenous mechanism for nuclear organization in mammals

    Silent chromatin at the middle and ends: lessons from yeasts

    Get PDF
    Eukaryotic centromeres and telomeres are specialized chromosomal regions that share one common characteristic: their underlying DNA sequences are assembled into heritably repressed chromatin. Silent chromatin in budding and fission yeast is composed of fundamentally divergent proteins tat assemble very different chromatin structures. However, the ultimate behaviour of silent chromatin and the pathways that assemble it seem strikingly similar among Saccharomyces cerevisiae (S. cerevisiae), Schizosaccharomyces pombe (S. pombe) and other eukaryotes. Thus, studies in both yeasts have been instrumental in dissecting the mechanisms that establish and maintain silent chromatin in eukaryotes, contributing substantially to our understanding of epigenetic processes. In this review, we discuss current models for the generation of heterochromatic domains at centromeres and telomeres in the two yeast species

    Structural basis of RNA polymerase III transcription initiation.

    Get PDF
    RNA polymerase (Pol) III transcribes essential non-coding RNAs, including the entire pool of transfer RNAs, the 5S ribosomal RNA and the U6 spliceosomal RNA, and is often deregulated in cancer cells. The initiation of gene transcription by Pol III requires the activity of the transcription factor TFIIIB to form a transcriptionally active Pol III preinitiation complex (PIC). Here we present electron microscopy reconstructions of Pol III PICs at 3.4-4.0 Å and a reconstruction of unbound apo-Pol III at 3.1 Å. TFIIIB fully encircles the DNA and restructures Pol III. In particular, binding of the TFIIIB subunit Bdp1 rearranges the Pol III-specific subunits C37 and C34, thereby promoting DNA opening. The unwound DNA directly contacts both sides of the Pol III cleft. Topologically, the Pol III PIC resembles the Pol II PIC, whereas the Pol I PIC is more divergent. The structures presented unravel the molecular mechanisms underlying the first steps of Pol III transcription and also the general conserved mechanisms of gene transcription initiation

    A user's guide to the Encyclopedia of DNA elements (ENCODE)

    Get PDF
    The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome
    corecore